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1. Introduction 

The purpose of this article is to review some of the recent work on the operator 

(1.1) CHxVOn = " i>n+l + XV(T n x)tp n 

on £ 2 (Z) , where T : X — > X is an ergodic transformation on (X, v) and V is a real- valued function. A is 
a real parameter called coupling constant. Typically, X = T d = (M/Z) d with Lebesgue measure, and V 
will be a trigonometric polynomial or analytic. We shall focus on the papers jGolSchl] and |GolSch2] by 
the authors, as well as other work which was obtained jointly with Jean Bourgain. Our goal is to explain 
some of the methods and results from these references. Some of the material in this paper has not appeared 
00 ■ elsewhere in print 1 . 
I/"") \ Even more specifically, we will be mostly concerned with the distribution of the eigenvalues of (f 1 . 1 1> - both 

on finite intervals [— N, N] as well as in the limit N — ► oo (and not so much with Anderson localization). In 
more technical terms, we are referring here to the integrated density of states or IDS. It is a nondecreasing, 
deterministic function N(E), and it is related to the Lyapunov exponent L(E) by means of the Thouless 
formula 

(1.2) L(E) = f \og\E- E'\N(dE') 

J: 



O 



(N 
> 



O 
in 
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As usual, we set 

1 



rrt ! L(E)= lim - / \og\\M n {x,E)\\v{dx) 

n— >oo n J x 

where M n are the transfer matrices 

l 

M n (x,E) = 11 



V(T k x)~E -1 
1 



k=n 

of i.e., the column vectors of M n are a fundamental system of the equation H x ip = Eip. 

One of the most basic problems related to concerns the positivity of the Lyapunov exponent L(E). 
More specifically, there are dynamical systems for which L(E) exhibits a "phase transition" from the region 
L(E) > to that where L(E) — when A varies, and there are systems for which L(E) > for all values of 
A 7^ 0. For instance, one expects that for the skew-shift T : T 2 — > T 2 , T(x, y) = (x + u), y + x), the Lyapunov 
exponent is positive for all A ^ 0. A rigorous description of phase transitions for L(E) or the proof of the 
absence of such transitions is a primary objective in the study of 

In the study of (|1 . 1|) much attention has traditionally been given to the fine properties of the distribution 
of the eigenvalues of (|l.lf> . i.e., the IDS. As observed by Avron and Simon AvrSim], and by Craig and 
Simon CraSim] the Thouless formula implies that the IDS is log-Holder continuous. In IGolSchl] it was 
shown that for positive Lyapunov exponents the IDS is Holder continuous, and their argument was refined 
in |Bou2| to show that the Holder exponent remains bounded below by a positive constant when the Lyapunov 
exponent approaches zero. For the almost Mathieu potential V(ff) = cos(0), Sinai [Sinlj and Bourgain Boul 
observed perturbatively (i.e., when A is large), that the IDS is Holder \—z continuous for any e > 0. Moreover, 

The first author was partially supported by an NSERC grant. The second author was partially supported by the NSF, 
DMS-0300081, and a Sloan fellowship. The authors dedicate this article to Yakov Grigorievich Sinai on the occasion of his 70th 
birthday. 

It is not our intention to provide a systematic and detailed review of the many developments that have taken place during 
the last five to ten years in this vast area and we apologize to those authors who have been involved in the study of these models 
but are not mentioned here. 
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it follows from Sinai's work that this is optimal (see also Puig |Pui2| L In the recent paper GolSch2 , the 
authors addressed the problem of determining the exact Holder exponent non-perturbatively assuming only 
that L(E) > 0. Furthermore, GolSch2 treats general potentials (trigonometric polynomials or analytic 
functions) and it is shown that the IDS is absolutely continuous. 

An interesting open problem is to understand the behavior of the IDS under the phase transition 

L(E) > — ► L(E) = 

Recall that for the almost Mathieu case it is known due to the work by Last Las and by Gordon, Jito- 
mirskaya, Last, and Simon GorJitLasSim , that the Lebesque measure of the spectrum decreases to zero 
when the coupling constant approaches the critical value A = 2. 

We now set out to describe some of the basic tools developed in the references |BouGol| . [GolSchl] . 
IBouGolSch) . |GolSch2| . 

2. Large deviation theorems 
It was shown by Fiirstenberg and Kesten jFurKesj that 

lim -\og\\M n {x,E)\\ =L(E) 

n — *oo fi 

for a.e. x G X. To quantify this convergence, set 

L n (E) = - f \og\\M n ( Xl E)\\v{dx). 
n Jx 

For certain types of dynamics and V the following large deviation theorems (or LDTs) are known to hold for 
some choice of < er, r < 1 : 

(2.1) is({xeX : |log||M„(x,S)|| - nL n (E)\ > n 1 "' 7 }) < Cexp(-n T ). 

For dcfinitcncss, let X = T, dv = dx the Lebesgue measure, and Tx = x + lu modZ with an irrational lu. 
Moreover, we shall assume that V : T — > K is analytic and nonconstant. The LDTs are known to hold in 
this case. 

To motivate (|2.1|) . consider first a commutative model case, namely 

i 

u(x) = log \e(x) — e{kuo)\ 

k=l 

with u> = 2 and e(x) = e 2mx . Then u(x) = log \ e(xq) — 1| and J T u{x) dx — so that for A < 

(2.2) mes ({x G T : u(e(x)) < A}) = mes ({ieT : \e(x) - 1| < e A }) 

which is of size e A (here mes stands for Lebesgue measure). In this model case, u(x + 1/q) = u(x). 

Returning to u(x) — log |jM„(x, E)\\, this exact invariance needs to be replaced by the almost invariance 

(2.3) sup \u(x) — u(x + koj)\ < Ck for any k > 1. 

x<ET 

The logarithm in our model case is a reasonable choice because of Riesz's representation theorem for sub- 
harmonic 2 functions (see Levin |Lev| 1 applied to the function u(z) = log \\M n (z, E)\\ : 

Let u(z) be a subharmonic function on some domain Q C C. Then there exist a positive measure /i (called 
the Riesz measure), finite on all compact sub-domains Q' C f2 so that 

(2.4) «(«)=/ log\z-C\vt(d£) + h(z) VzeO' 

Jn' 

where h is harmonic on fi'. 

If u € C 2 (f2) then subharmonicity is the same as Au > and /i = Au. Moreover, in this case (|2.4() is an 
instance of Green's formula. The general case follows by taking limits. In what follows, we will also make 
use of the following bounds on fi and h: If (|2.4|) holds and |it| <ifon Of, then 

(2.5) fx(n") + \\h\\ Laa{Q , f) <K 

2 This is defined to mean upper semicontinuous and satisfying the sub-mean value property. 
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where 57" C 0' (as a compact sub-domain). 

2.1. Cartan estimates. One way of deriving (|2.1() from (|2.3(l and (|2.4() is via Cartan's estimate for sub- 
harmonic functions: 

Given a finite positive measure fi on C and H > f/iere is a (possibly infinite) collection of disks {D(zj, rj)} 
so that? 

J2 r j<CH and u(z)> -C\\iM\\logH Vz £ C\ \jD{z j ,r j ) 

j 3 

The most obvious example here is of course /i = uSq. More generally, for the measures A* = X^fe ^z k with 
some finite collection {zk}k C C of (not necessarily distinct) points, Cartan's theorem becomes a statement 
about polynomials which already captures all the main features. In order to prove (|2.1(l we will impose the 
Diophantinc condition 

(2.11) \\nu>\\ > — Vn > 1 

n a 

where c = c(ui) > and 4 2 > a > 1. Here || • || measures the distance to the nearest integer. 
Proof of J2J). Write 

u{z)=log\\M n (z,E)\\= f log\z-C\fi(dC)+h(z) 



on some open rectangle R which contains 5 [0, 1]. Then < u{z) < n and thus n(R') < n where R' C R is 
a slightly smaller rectangle, as well as ||/i||l°°(.r') < n, see (12.511 . Fix a small (5 > and take n large. Then 
there is a disk Dq = D(xo,n~ 2S ) with the property that /z(-Do) < n 1_2A . Write 

u(z) = ui{z) + u 2 (z) = [ log \z - C| /i(dC) + / log |* - CI MO 
Set Di = D(a;o,n- 3 ' 5 ). Then 

\u 2 (z) - u 2 {z')\ < n 1 - 5 yz,z / £D 1 
Cartan's theorem applied to u\(z) yields disks {D(zj, rj)}j with Ylj rj < exp(— 2n s ) and so that 

Mz)>-n^ 5 Vz £ C\\J Diz^rj) 

j 

Since also u\ < on Di as well as \h(z) — h(z')\ < n|z — z'|, it follows that 

(2.12) ^n 1 ' 5 Vz,z'e D 1 \\jD(z j ,r j ) 

j 

^From the Diophantine property (with a < 2), for any x,x' £ T there are positive integers k,k' < n 45 such 
that 

x + fca;, x' + k'uj £ D\ mod Z 

In order to avoid the Cartan disks [Jj D(zj,rj) we need to remove a set B C T of measure < exp(— n* 5 ). 
Then from the almost invariance (|2.3|l . for any i,i'eT\8, 

- u(a/)| < n AS + n 1 - 5 < n 1 ' 5 

This implies (|2.1() with er = r = <5 and we are done. □ 

This proof generalizes to other types of dynamics as well. For example, let Tx = x + uj modZ , d > 2, 
be a higher-dimensional shift . 



Exact numerical values of the constants C are known, see Levin. Also, one can replace ^ Tj with rj for any s > 0, 
which implies that {m = — oo} has Hausdorff dimension zero. 
4 The upper bound of 2 here is more of a cosmetic nature 
^recall that u(x) = log ||M n (x, E)\\ is a one-periodic function 
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Definition 2.1. Let < H < 1. For any subset B C C we say that B £ Car^iJ) if B C (J. D(zj,rj) with 
(2.13) ^Vj^Co-H". 

If d is a positive integer greater than one and B C C d we define inductively that B £ Cax d (H) if there exists 
some Bo £ Card-i(H) so that 

B = {(zi,Z2, ...,Zd) ■ (Z2, ■ ■ ■ , Zd) S Bo or z\£ B(z2, ...,Zd) for some B(z 2 , . . . , z d ) £ Cari(iT)}. 

We refer to the sets in Caid(H) for any d and H collectively as Cartan sets. 

Using the following theorem from [QolSchl] the previous proof of l|2.1Jl easily generalizes. We state the 
case d = 2, with d > 2 being similar (see also jSchj ). 



Theorem 2.2. Let u be a continuous function on _D(0, 2) x D(0, 2) C C 2 so that \u\ < 1. Suppose further 
that 

Z\ i— > u(zi, z-i) is subharmonic for each z-i £ D(0, 2) 
Z2 i— > u(zi, z 2 ) is subharmonic for each z% £ D(0, 2). 

Fix some 7 £ (0, |). Given r £ (0, 1) there exists a polydisk H = D{xoi,r l ~~*) x D(xo 2 , r ) C D(0, 1) x -D(0, 1) 
with xo±, X02 S [— 1, 1] and a set B £ Car2(i?) so that 

(2.14) \u{z 1 ,Z2)-u{z[,z l 2)\ < C 7 r 1-27 log- /or aZJ (21,^2), (4,4) 6 H\B 

(2.15) = cxp^-r^ 7 ). 

The point of this theorem is that it takes the place of (|2.12|) in the previous proof. 
2.2. Fourier series. An alternative approach to (|2.1(l is based on Fourier series. Indeed, one writes 

u(x) — (u) = ^ u(x + ju>) — (u) + 0(k) = u(v)e(xv) — e(jvu)) + 0(k) 
j=l v^O j=i 

Then one has that 

k 



ItX>0M| ^min^fe-^IHI" 1 ) 

for all v > 1. Also, it follows from l|2.4|l that \u(v)\ < which in turn implies that 

1 fe . 

\u{x) ~ (u)\ < -V V &{v)e(u(x + ku)) + nkr 1 min(l,Jb- 1 ||iw||- 1 ) 



J'=l |v|>if 0<|^|<if 

Clearly, 



k 

2=1>|>K 



I > u(u)e(u(x + kui)) 

L 2 

X 

Taking K = e n it follows from the Diophantine condition that 

^ nl^] -1 min(l, /c 1 1 1 z^Cij' 1 1 1 ) < nfc~^ logi^T < n 1+T k~^ 
Q<\u\<K 

Choosing r > small and k — ni , say, yields Il2.1|) . 

For applications related to the study of fine properties of the IDS it turns out to be important to obtain 
sharp versions of J23J- The commutative model example suggests that the optimal relation is0<<7 = r<l. 
This is indeed the case, see Section 0] below. 

This proof also generalized to higher-dimensional tori, see |BouGoH as well as |Bou2| . S. Klein |Klej has 
removed the analyticity assumption and obtained estimates as in 1)2. 1|) for the Gevrey classes by means of 
Fourier methods (using higher-order Fejer kernels) and suitable truncations of the Fourier series. 
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2.3. Other dynamics. The arguments which we have just presented do not depend on positive Lyapunov 
exponents. The situation is very different for the skew-shift defined by T(x, y) — (x + y,y + u>) on T 2 . The 
point here is that T n (x,y) = (x + ny + n(n — l)u>/2,y + nu>) modulo Z 2 . Due to the presence of ny in 
the first coordinate we are faced with the problem that the Riesz mass of the subharmonic extensions of 
log \\M n (x, y, E)\\ is now of size n 2 , at least if we consider the extensions to a fixed neighborhood 6 of T 2 C C 2 . 
Indeed, in this case |j M n (x, y + ie, E)\\ behaves like a product IX)=ri eJe which is of size e Cen . Thus, in this 
context neither of the two methods discussed so far lead to a bound of the form l|2.1[l for the skew-shift (the 
problem is that these methods only gain a factor of n~ s over the Riesz mass as far as the deviations are 
concerned - here we would therefore get n 2 ~ s for the deviations which is useless). 

In BouGolSch a LDT is proved for the skew-shift but for large disorders. This refers to the fact that 
the potential has to be of the form XV for large A. The method in |BouGolSchj proceeds by induction over 
the scale n, and the first stage requires large A. The inductive step is realized by means of the avalanche 
principle (see the following section) which is a purely deterministic statement about products of 2 x 2 matrices. 
Moreover, the analytic difficulty of having n 2 Riesz masses is circumvented by the following splitting lemma 
from BouGolSch] (see also |Bou2| ): 

Lemma 2.3. Suppose u is subharmonic on A p (a p-neighborhood off), with sup^^ \u\ < N. Furthermore, 
assume that u = uq + u\, where 

(2.31) ||uq - («o)IU»(T) < £o and ||ui||li(T) < £i- 
Then for some constant C p depending only on p, 

(2.32) IMIbmo(t) < C p (e o + 0V^)- 

To apply this lemma one uses the avalanche principle to generate the splitting into uq and u\ with an 
exponentially small e\ ~ e~ n . This allows for Riesz masses N which are polynomially large, say N = n c as 
is the case for the skew-shift. 

Finally, and in a very different vein, we would like to mention that LDTs have also been established for 
the doubling dynamics x i— ► 2x mod 1 and for very small disorder A > in [BouSch] . The latter is needed 
in order to apply the Figotin-Pastur formula, see |FigPas| . 

Generally speaking, it remains an open problem to prove LDTs for Schrodinger cocylces with potentials 
of the type XV(T n x) for general classes of dynamics T, disorder A, as well as wider classes of potentials V. 

3. Positive Lyapunov exponents 

As we have already mentioned of the central problem concerning ll.lfl is to decided whether or not 
L(E) > 0. In the case of random i.i.d. potentials this was established by Furstenberg |Fur| . In case of quasi- 
periodic potentials, the well-known Herman's method |Her| establishes this positivity for large disorders 
provided the potential function V is a trigonometric polynomial. Sorets and Spencer extended this to 
analytic V . Here we present a different approach, which is based on the following avalanche principle (AP) 
from jGolSchlj (for this version which does not assume that the matrices belong to SX(2,M) see _Gol Sch2] ). 

, A n be a sequence of 2 x 2-matrices whose determinants satisfy 

max I det A A < 1. 

l<j<n J 



min ||.Aj|| > p > n and 

l<j<n 

max [log||A i+1 || +log||A 7 -|| - log || A j+1 A } ||] < -log^t. 

l<j<n I 



^Shrinking to a neighborhood to size 0(n 1 ) reduces the Riesz-mass to < n, but then there is a price to pay for the small 
diameter of the neighborhood. 



Proposition 3.1. Let A±, . . . 

(3.1) 

Suppose that 



(3.2) 
(3.3) 
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Then 
(3.4) 



log IK 



n— 1 n— 1 

Ax || + J2 log 11^ || log || A j+ i Ai 

J=2 j=l 



<c n - 



with some absolute constant C . 



The meaning of (|3.3|l is that adjacent matrices do not cancel pairwise, whereas (|3.2|) insures that each 
matrix is sufficiently large. The conclusion is that the entire product has to be large with the very precise 
difference bound from (|3.4(1 . 

As an application of this principle, let us study the rate of convergence of Ln(E) to L(E) for the opera- 
tor (JO}. We will assume that L(E) > 7 > for some E € R. Furthermore, we shall assume that there is 
a LDT of the form (|2.1|) . We shall make no other assumptions on the dynamics T. Given a large integer n, 
define k = C (log A)~ (here r is as in ^.lt l. Then 

1 

M N (x, E) = M k „(T k ' +ek x, E)M k ,(T ek x, E) J[ M k {T^ k x, E) 

3=1 

where k/2 < k',k" < k. In view of (|2.1|) there exists a set B C X of measure < A^ -10 , say, so that for 
all x € X \ £> we can apply the AP to this product. This requires making Co = Co (7) large. In order to 
check Q3.3H one uses the fact that Ljsf(E) — * L{E) as n — > 00. We can now average l|3.4|l over x G A which 
yields 



(3.5) 



\L N {E)~2L 2k {E) + L k {E)\ 



A 



Applying the same reasoning with and the same choice of k yields 



\L 2N (E)~2L 2k (E) + L k (E)\ < 

and thus also 

\L 2N {E)-L N (E)\ 
Passing to the limit therefore implies that 



(logA)- 



N 



< (logA)T 



< L N (E)-L(E) < 



N 
(log NY- 



N 



This can be further improved to 

< L N (E)-L(E) < N- 1 

Moreover, this convergence holds uniformly in the energy for all intervals 1 cl for which infgg/ L(E) > 
7 > 0, see |GolSchl| . 

The AP can also be used to establish positive Lyapunov exponents. Indeed, let V : T d — > R be an analytic 
potential and T : T d — > T d be ergodic. The matrix 

\XV(T^x)~E -l" 



M n (x,X,E) = Yl 



1 







denotes the transfer matrix of the equation where the potential is now written as W(T n x). As before, 

L n {X,E) = - [ log\\M n (x,X,E)\\dx 
L n (X,E) exists. Finally, let S(X,E) be a number satisfying 



and L(X, E) = lim„ 

(3.6) S(X,E) 
Then the following is shown in [GolSchl] : If the weak large deviation theorem, ( with some a > 0) 



sup sup — log || M n (x, A, E)\\ 
n>l X £T d n 



(3.7) 



/ -log\\MJx,X,E)\\ -L n (X,E) dx < CS(X,E) n - a 
J T d n 
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holds for all n = 1, 2, . . ., then 



inf L(X, E) > for all A > A (V, d, a) 

E 



This is proved inductively, the main step being described by the following lemma: 

Lemma 3.2. Suppose that Ij3.7|l holds for all n with some choice of a > 0. Then there exists a positive 
integer £q — £q(<j) such that if 

(3.8) L t > S(E,\)e-°/ 4 and L e (E, A) - L 2 e(E, A) < Ll \' X > 

8 

for some £ > £ Q , then L(E, A) > L t {E, A)/2. 

This lemma does not require any further information about T or V other than (|3.7|l . On the other hand, 
to insure that the conditions of this lemma are met, one chooses A large using analyticity of V. It would be 
interesting to apply this method to non-analytic V which satisfy some natural non-degeneracy assumption 
(in particular, one would need to establish JjUJ). 

K. Bjerklov |Bje| recently used this result to prove positive exponents for some interval of energies for his 
class of potentials which exhibit mixed behavior (i.e., both zero and positive exponents). 

4. Regularity of the IDS 
Let us now assume that we have the following sharp LDTs for u n (x) = log \\M n (x, E)\\ : 

(4.1) v({x S X : \u n {x) - nL n (E)\ > nS}) < exp(-c(<5)n + C(log n) A ^j . 

where c(S) > and C, A are some constants . For the case of X = T and the shift by a Diophantine 8 u>, such 
estimates were obtained in [GolSchl] with c(<5) = 5 2 . Assuming positive Lyapunov exponents L(E) > 7 one 
can further show that c(S) = c(j)5 which shows that one can take 0<cr = r<lin H2.1(l (the latter is done 
via the AP). 

Then by the arguments of the previous section we obtain the following stronger version of 13.5|l 

\L N {E) - 2L 2k (E) + L k (E)\ < ^ 

This is due to the fact that we can break up Mm into products of matrices of size k — [Clog TV] when 
applying the AP. Therefore, 

\L(E) - L(E')\ < \L N (E) - L(E)\ + \L N (E') - L{E')\ 

+ \L N (E) - 2L 2k (E) + L k (E)\ + \L N (E') - 2L 2k (E') + L k (E')\ 
+ 2\L 2k (E) - L 2k (E')\ + \L k (E) - L k (E')\ 
< N^ 1 log A + \E- E'\exp(Ck) < A -1 log A + N B \E - E'\ 

Consequently, choosing A = \E — E'\~ a for some < a < 1, we deduce that L(E) and therefore also that 
the IDS (by I jl.2| l1 are Holder continuous. 

This approach to the regularity of L(E) was subsequently modified by other authors. For example, 
Bourgain and Jitomirskaya |Boujit| use this very approach from |GolSchl| to show that L(E,u) is jointly 
continuous away from rational u> for the case of the shift on T. Their argument is based on sharp large 
deviation theorems, the avalanche principle, as well as the difference relation (|3.5I) . 



^We can consider these estimates as a "black box" without specifying T or V further 
8 This means that \\nuj\\ > ,, c ^ ,„ with a > 1 

11 11 — n(log n) u 
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5. Eigenvalues, localization, and the zeros of f N (z,ui,E) in z 

In GolSch2 a different approach to the regularity of the IDS was developed which allows us to obtain a 
lower bound on the Holder exponent by non-perturbative methods. In fact, for the almost Mathieu model 
with A > 2 it is shown that the Holder regularity is \ — e for any e > 0. Moreover, off a set of Hausdorff 
dimension zero the IDS is 1 — e Holder regular for any e > (the latter requires the removal of a set of to 
of measure zero). Similar results are obtained for other potentials assuming positive Lyapunov exponents as 
well as the strong Diophantine condition (|2.11|) . 

It is well-known that in the case of the almost Mathieu model the exponent \ cannot be improved. 
For large disorders this was observed by Sinai |Sinl| . whereas Puig [Puilj . |Pui2| has obtained this non- 
perturbatively. On the other hand, Bourgain [Boulj has shown for the almost Mathieu operator that the 
Holder exponent is no worse than i — e for very large disorders. 

In the almost Mathieu case the optimality of i is intimately connected with the Cantor structure of the 
spectrum. In fact, it is at the gap edges that one encounters loss of the Lipschitz behavior. For more general 
potentials as in GolSch2] this connection is not clear and it would certainly be of great interest to elucidate 
the connection between gaps and the regularity of the IDS further. 

We now set out to describe some of the basic ingredients of |G*olSch2 . 



5.1. Large deviation theorems for the entries. Recall that 

f n (x,E) -f n _x(Tx,E) 



where 



(5.11) f n (x,E) =det 



M n {x,E) = 
v(l,x) - E 



f n -i{x,E) -f n _ 2 (Tx,E) 



1 



-1 




. . 

v(2,x)-E -1 0. 

-1 v(3,x)-E -10 











-1 v(n, x) — E 



It is customary to denote the matrix on the right-hand side as iJr l ra ](a;) — E so that one as f n (x,E) = 
det(H[ l n ] (x) — E). The following result is proved in IGolSch2] . Henceforth, we shall assume that the 
Lyapunov exponents are positive as well as that tu is strongly Diophantine. 

Lemma 5.1. There exist constants A and C depending on oj and the potential V , so that for every n > 1 



(5.12) 
(5.13) 

In particular, for every n > 1, 



log | det(-ff[i : „j (x, lu) - E)\dx -n L n (uj, E) 



log | det(i?H„] (x, u) -E)\ || B mo < C(logn)' 



< C 



(5.14) 



GT| \log\det(H [1M (x,u)-E)\-nL n (u,E)\ >H 



< Cexp 



cH 



(logn) 



for any H > (logn)" 4 . Moreover, the set on the left-hand side is contained in at most < n intervals each of 
which does not exceed the bound stated in (|5.14() in length. 

The point here is of course that the entries of M n satisfy the same bound as M n itself. One basic step in 
the proof of this lemma is to show that 



(5.15) 



log \f n {x, E)\dx> nL n (E) - Cn a 



with some a < 1. Then combine this with a uniform upper bound of the form (see jGolSchl] ) 



sup log !/„(£,£) | <suplog||M n (z,£;)|| <nL n {E)+Cn° 
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to conclude that 

log |/„(a;, £)| = / \»z\fJ.r.E)\ ,1-r - »,(.,•) Willi \\u L i ; c ,r 
Thus, by Lemma T2. 31 we conclude that 



log !/„(*, £)l I 



BMO 



< „(l+o-)/2 



which beats the trivial bound of n. We used here that the Riesz mass of the subharmonic extension of 
log |/ n (x, E)\ to a neighborhood of the circle is < n. 

The John-Nirenberg inequality therefore implies the large deviation theorem 



(5.16) 



mes 



({xeT : \ \og\f n (x,E)\-nL n (E)\>n 1 - 5 })<exp(-n s ) 



provided 5 > is sufficiently small. 

This is of course considerably weaker than Lemma l5.ll To improve on it, we apply the avalanche principle 
to the product 



1 




M n (x,w,E) 



1 




fn(x,E) 




Note that the AP does not require SX(2,R) matrices, but rather l|3.1[) which holds here. Write n = l\ + 
(m - 2)t + t m where I x (logn) c °, l x x l n x t, and set s x = 0, Sj- = 4 + (j - 2)^ for 2 < j < m. Hence, 



/„(z,£) 




= n a m 



where Aj(x) — Mg(x + sjui), for 2 < j < m—1, and Ai(x) = Mi 1 (x) 



1 




A m (x) 



1 




Mt m (x+s m w). 



-100 



One checks easily from (I5.16f) that the conditions l|3.2[) and (|3.3[) hold up to a set of x of measure < 
say. Hence, by Proposition ^. II 

m— 1 m— 1 / -. 

(5.17) log|/ n (a,2J)| =- £ logK-(x)|| + ]T log || + O - 

We now invoke the following LDT for sums of shifts of subharmonic functions, see Theorem 3.8 in [GolSchl] : 
For any subharmonic function u on a neighborhood of T with bounded Riesz mass and harmonic part 



(5.18) 



mes 



€ T | | y ] u(x — ku) — n(u) | > Sn < exp(— cSn + r„) 



fc=i 



where r„ < (logn)' 4 . The sums in (|5.17|l involve shifts by £uj rather than u. In order to overcome this, note 
that we can take t n > 2£, say. Repeating the argument that lead to (|5.17l) I — 1 times with the length of A\ 
increasing by one and that of A m decreasing by one, respectively, at each step leads to 

log \f n (x, u, E)\ = -- Y lo S H^J'^ + fcw )H + J Y Y lo § I' ( A i+i A i)( x + Mil + j Y Uk ^ + ° 

k=0 j=2 k=0 j=2 

(m-l)e-l 1 (m-l)^-l 

(5.19) = -~ V log\\M e (x + + 



fc=0 



_^ (m-1^-1 (m-l^-l £-1 

- ^ Iog||AC e (a!+ja;)|| + j ^ log||A/ 2 *(z +ja;)|| + - £u fe ( 



£ \ n 

fc=0 



The functions it^ compensate for omitting the terms j = 1 and j = m — 1 when summing log ||vlj_|_iAj||. 
They are subharmonic, with Riesz mass and harmonic part bounded by (logn) Co . Estimating the sums 
involving Mi and by means of (|5.18() . and the sums involving Uk directly by means of Cartan's bound 
shows that there exists BcTof measure < exp(— (log N) c °), so that for all x € T \ B, 



Thus, 



)ag\f n (x,w,E)\- Qog\f n (x,L>,E)\) < (\ogn) 2C °. 

log\f n (x,(J,E)\ = U (x) + Ui(x) , 



10 



MICHAEL GOLDSTEIN AND WILHELM SCHLAG 



where 

IK - <io g |/ n (-,£OI>IU-m ^ ( lo g«) 2Co > 

and 



\\ Ul -(\og\f n (;E)\)\\ Llm < ||log|/ n (-,E)| || i2(T) Vmes(B) 
Applying Lemma 12.31 one now obtains that 



< n ■ \/mes(S) < exp ( — -(logn) c ° 



log !/„(*, u;, .E)l < C ( (logn) 2Co+1 + J,i • cxp ( -i(logn) c o 

BMO(T) \ V \ 4 

< C(logn) 2Co+1 , 

as claimed. 

It therefore remains to obtain (|5.15|l . For this, as well as other details of Lemma f5. II we refer the reader 
to Section 2 of |GolSch2| . 

We remark that (|5.19|l illustrates how the AP allows us to write the determinants f n (x,E) as rational 
functions which are composed of products of shifts of "short" functions (more precisely, of functions with 
small Riesz mass). This cannot be done for all x (because of the bad sets in the LDTs) and also leads to 
certain small errors. This approximate factorization is one of the basic tools of GolSch2 . 

5.2. Uniform upper bounds and zeros of determinants. The following result based on Lemma [5.11 
improves on these uniform upper bounds. Uniform upper bounds on the norm of the monodromy matrices 
in terms of L(E) were found in |BouGol| . |GolSchl] . The (log TV)" 4 error obtained in |GolSch2] (rather than 
N" , say, as in |BouGol| and |GolSchl| 1 is crucial for the study of the fine properties of the integrated density 
of states. 

Lemma 5.2. Let u> be as in (|2.11() . Assume L[uj, E) > 0. Then for all large integers N , 

sup log\\M N (x,u,E)\\ < NL n (uj,E) + C(logN) A , 

for some constants C and A. 

Proof. We only consider x and suppress u and E from most of the notation. Take I X (}ogN) A . Write 
N = (n — 1)£ + r, £ < r < 2£ and correspondingly 

o 

M N (x) = M r (x + (n - 1)£lu) JJ Mi(x + j£u) . 

j=n-2 

The avalanche principle and the LDT (|4.1|l imply that for every small y there exists B y C T so that 
mes (B y ) < N~ wo and such that for x € [0, 1] \ B y , 

n— 3 n—2 

log||Mjv(a; + iy)|| = ^ log || M 2e (x + jiuj + iy)\\ - ^ log \\M e (x + j£w + iy)\\ 

3=0 j=l 

(5.21) + log \\M r (x + (n - l)£uj)M e (x + (n - 2)iu)\\ + 0(1) 

n— 3 n—2 

= log \\M 2i (x + jiuj + iy)\\ - J2 lQ g \\Mt(x + jtu) + iy)\\ + 0{£) . 

3=0 3=1 

Combining the elementary almost invariance property 

log\\M N {x + iy)\\=l- 1 Y, log\\M N (x + jto + iy)\\ + 0(£) 
o<j<e-i 
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(5.22) 



with H5.21[) yields 

log\\M N (x + iy)\\ =t~ 1 log \\ M 2e (x + jiu + iy)\\ 

0<j<N 

-r 1 ]T log\\M e (x+jLj + iy)\\+0(£) , 

0<j<N 

for any x e [0, 1] \ B' y , where mes B' y < N~ 9 . Integrating (|5.22(l over x shows that 
(5.23) L N {y, E) = 2L 2e (y, E) - L e (y, E) + 0(£/N) . 

This identity is formula (5.3) in [GolSchi] (with y = 0). Since the Lyapunov exponents are Lipschitz in y, 
the sub-mean value property of subharmonic functions on the disk 7?(x,0;5) with S — N^ 1 in conjunction 
with H5.22J) and (|5.23(l implies that, for every i£T, 



\og\\M N (x)\\ 



log\\ M N (0\\d£ 



(5.24) 



T>(x,0;5) 

i 



u (£ + fa + «7) - N ( u (- + ir i)) 

.0<j<N 

v(£ + ju> + ii]) ~ N(v(- + in)} 

0<j<N 



where 44- 



denotes the average over the disk, 



<i£ drj 

(%dr) + 0(l) , 



□ 



ufc + irf) :=* -1 log||M M (f + and v(£ + irf) := t~ l log \\M t (S + irj)\\ , 

and (•) denotes averages over the real line. The lemma now follows easily from (|5.24|) . 

The first application of this estimate is as follows: 
Lemma 5.3. Let to satisfy y2.ll)) . Then for any xq € T, Eq G R one has 

(5.25) #{EeR:f N (e(x Q ),w,E)=0, \E - E \ < exp(-(logiV) A ) } < (logiV)^ 

(5.26) #{zeC:W^,£„) = 0, \z - e(x )\ < N' 1 } < (log N) A > 
for all sufficiently large N . 
Proof. It follows from Lemma l5.1l that 

sup{log\f N (e(x),(j,E) \ :xeT, E e C, \E - E x \ < exp(-(log iV) A ) } 

< NL n (uj,E{) + {\ogN) B 

for any E\. Due to the large deviation theorem, there exist x\,E\ such that \x$ — x\\ < cxp(— (logiV) 
\Eq - Ex] < exp(-(logiV) 2A ) so that 

loelfjfiefjd^u.Ej)] > NL n (lu, E,)- (log N) 4A . 
Due to Jensen's well-known formula, see (|5.71|) below, 

# {E : f N (e(xi),u,E) = 0, \E - E 1 \ < exp(-(logiV) A ) } < (\ogN) c . 
Since ||iJ^'-'(xo,a;) — Hfc'(xi, u>)\\ < exp(— (log TV) 2 " 4 ) and since (xo,w) is self adjoint one has 
#(E:f N (e(x ),uj,E) = 0, \E - E \ < exp(- logiV) 2A ) } 
< #{E:f N (e(xi),u,E) = 0, |£ - ^| < cxp(-(log A0 A ) } < (logiV) C . 
That proves (|5.25|) . The proof of l|5.26|) similar. □ 
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These estimates of the local number of zeros of the determinants fa allows one to factorize /jv in each 
neighborhood of size exp(— (logJV)" 41 ) using the Weierstrass preparation theorem, with a polynomial factor 
of degree at most (log N) c . For example, the following is proved in GolSch2 , see Section 6. 

Proposition 5.4. Given zq € A po /2, Eq G C, and luq as in (|2.11|l . there exist a polynomial 

P N (z, u>, E) = z k + a k ^(u, E)z k - 1 + ■ ■ ■ + a {E, to) 

with aj(u,E) analytic in T>(Eo,ri) x T>(uo,ri), r% x exp (— (\ogN) Al ) and an analytic function 

g N {z,oj,E), (z,u,E) eV = V(z ,r ) x V(E ,n) x V(u ,n) 

with rg x N^ 1 such that: 

(a) fa(z,w,E) = P N (z,uj,E)g N (z,uj,E) 

(b) 9n(z, oj, E) ^ for any (z, w,E) eP 

(c) For any (u>,E) £ T>(u>o,ri) x T>(Eo,ri), the polynomial Pn(-,uj, E) has no zeros in C\2?(zo,ro) 

(d) k = deg P n (-,lj,E) < {logN) A . 

Another application of the uniform upper estimates is the following analogue of Wegner's estimate from 
the random case (see [Weg ). It will be important that there is only a loss of (\ogN) A in 1(5.27(1 . 

Lemma 5.5. Suppose ui satisfies (12.11(1 . Then for any N 1, E s R, H > (log7V) A one has 

(5.27) mes {ieT: dist(sp H N (x,u),E) < exp(-iJ)} < cxp(-i//(logiV) A ) . 

Moreover, the set on the left-hand side is contained in the union of < N intervals each of which does not 
exceed the bound stated in \5.21\j in length. 

Proof. By Cramer's rule 

(H N (x,u)-E)~ l (k,m) 

By Lemma 5.2 

log | f [lM (e(x),uj, E) | + log |/ [m+ i,jv] (e(a), w, E) \ < NL{u, E) + (log N) A ' 
for any i£T. Therefore, 

T2 e^p{NL(u J ,E) + (log N) A ) 



\f[l,k]( e ( x )> u ' E )\ 


\f[m+l,N] 


(e(x),u,E)\ 




fa(e(x),w,E) 





\(H N (x,u) - E) || < iV 2 



\fa(e{x),u,E)\ 



for any x £ T. Since 

dist(sp(#iv(a;,u;),£) = \\(H N (x,u) - Ey 1 ]]' 1 , 
the lemma follows. □ 

5.3. Elimination of resonances and the separation of zeros. Given arbitrary E € M, the typical 
distance from E to the eigenvalues of equation 1(1. 1 f> on a finite interval [-N, N] should be at least const -N^ 1 . 
If for some E € R and x £ T this distance p is considerably smaller 9 than iV _1 , then we say that (E, x) are 
in resonance and we refer to p _1 as the magnitude of the resonance. Clearly, the x-averaged distribution of 
the eigenvalues of ((1.1(1 on the interval [— JV, JV] controls the probability of resonances. For more accurate 
estimates the fine properties of this distribution are very important. 

Assume that for some E S R, x £ T, n E Z, both (E, x) and (E, T n x) are in resonance. In this case we 
say that this pair forms a double resonance. Double resonances play a crucial role in any proof of Anderson 
localization, i.e., that the eigenfunctions of p. 1(1 decay exponentially as |n| — > oo. This was found in Sinai's 
classical work |Sinl| . where Anderson localization was established for ((1.1|) with V(x) = cos(27rx) and large 
|A|. A novel, non-perturbative approach to the study of double resonances was found by Bourgain and the 
first author in |BouGol| . It is based on the following notion: 

Definition 5.6. A set S C R™ 1 is called semi-algebraic if it is a finite union of sets defined by a finite 
number of polynomial inequalities. 



^Technically speaking, this means exp (—TV 6 ) with b < 1 
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For instance, let V(x), x G T be a trigonometric polynomial and consider the dynamics of the shift on T. 
Then for fixed x the double resonances can be included into a semialgebraic set (in the to, E plane). An 
important parameter of a semialgebraic set is its degree which equals 

(number of polynomials involved) x (maximal degree of these polynomials) 

If V is a trigonometric polynomial, then the degree of the set of resonances on the interval [-N, N] is at most 
N c for some absolute constant C . On the other hand, the only "dangerous resonances" for the localization 
are those of magnitude exp(ciV). That allows one to eliminate double resonances for the case of the shift 
x — > x + to and skew-shifts (x, y) — » (x + u>, y + x) using simple geometrical ideas related to semialgebraic sets, 
see |Bou2| . Section 9. In GolS ch2j we develop a more quantitative method for analyzing the resonances, 
which is based on the theory of resultants and discriminants of polynomials. The polynomials in question 
are those which arise in the factorization of the determinants /n via the Weierstrass preparation theorem in 
the phase variable x. This method turns the set of resonances into a set on which some analytic function (in 
the case of a double resonance it is the resultant) attains very small values. Cartan's estimate from above 
applied to this analytic function then leads to bounds on the measure and complexity of this set. The logic 
of this is captured by the following lemma from Section 7 in ( iolSch2|. We also use the following notation: 
Given w = (wi >0 , . . .,w d ,o) G C d , r = (n, . . . ,r d ), r< > 0, i = 1, 2, . . . , d, set 

Sw , L {wi, ...,vid)= (r^iwi - wi. ), . . . ,r d ~ 1 (w d - u>d,o))- 

Lemma 5.7. Let P s (z, w) = z ka + a Sy k s -i(w)z ks ~ 1 + ■ ■ ■+a s .o(w), z G C, where a s j(w) o,re analytic functions 
defined in some polydisk V = Y[ D(wi_o, r), w = (wi, . . . , Wd) € C d , w = (u>i,ci; ■ ■ • j Wd.o) G C d , s — 1,2. 

i 

Assume that k s > 0, s = 1, 2 and set k = fcife- Suppose that for any w_&V the zeros of P s (-,w) belong to the 
same disk D(zq, ro), ro <C 1, s = 1, 2. Let t > 16fcro r" 1 . Given H 3> 1 there exists a set Bh C V such that 

d 

Sw^.^lQkrot- 1 ,r,...,r) 

(B H ) G C&v d (H 1 / d ,K),K = CHk and for any we V(w lfi ,8kr /t)x ]J V(w ]fi ,r/2)\B H 

i=2 

one has 

(5.31) dist ({zeros of Pi(-,w)}, {zeros ofP 2 (-+t(w 1 -wi, ),w)}) > e-° Hk . 

It is instructive for the reader to first consider the meaning of the previous lemma for the case where 
neither P\ nor P2 depend on w. Lemma 15.71 is the principal tool for eliminating resonant phases and energies 
in the paper IGolSch2*| . As a typical application of it we mention the following lemma on the separation of 
the zeros of determinants from Section 8 of GolSch2 . Let T(x) =i+wbea shift and fi>f(z, oj, E) be the 
Dirichlet determinants defined as in l|5.11J) with v(n,x) = V(x + nuo). Also, Z(f,zo,ro) denotes the set of 
zeros of / in the disk T>(zq, ro). 

Lemma 5.8. Let C\ > 1 be an arbitrary constant. Given li > I2 ^> 1, t > exp((log£i)" 4 ), H ^> 1, there 
exists a cover o/T x T CjQ x [— C\, C\] by a system S of polydisks 

V{x m ,r) x V{iu m ,rt- 1 ) x V{E m ,r), x m G T, E m G [-C U C X ] , 

with UJ m G T c a , and r = exp(— (log^i)' 42 ), and which satisfies the following properties: S has multiplicity 
< 1, cardinality #(5) < t exp((log^i)' 41 ) and for each m, there exists a subset £lf. 1 .i 2 .t,H,m C T>(bj m , rt^ 1 /2) 
with 

5„ mi rt-va(^ 1 A,t,ff,m) G CaniH 1 / 2 ^), K = (log^) 5 
such that for any lu G T>(u> m , rt^ 1 /2) \ £li lt i 2 ,t,H.m there exists a subset 

) G Can(i7 1/2 ,X) 

such that for any E G T>(E m , r) \ £e 1: £ 2: t,H,uj,m. one has 

dist(^Z(f il (.,u ) ,E),e(x m ),r),Z(f e2 (.e(tu>),u;,E),e(x m ),r)) > e - H ^^ c . 
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5.4. Localization. Let fjy(z,uj,E) be the Dirichlet determinants for equation on the interval [1,N] 
with the dynamics Tx = x + w. The separation of the zeros of /at(-, u>, E) and /n(x + nuj, u>, E) described in 
Lemma l5~Bl is achieved due to the dynamics z — > ze(cu). It is shown in GolSch2 that this separation property 
implies that for most energies the associated eigenfunctions on a finite interval [-N, N] are exponentially 
localized. Furthermore, the authors also show that this localization on a finite interval can be used to obtain 
a lower bound on the minimal distance between a typical pair of eigenvalues, i.e., between the zeros of 
E i— > fff(x,u>,E). This application of Anderson localization will be described in the following subsection. 
Here we outline how one can find the localized eigenfunctions. 
Any solution of the equation 

(5.41) -ip(n + 1) - i/}(n - 1) + v(n)tp(n) = Eip(n) , n G Z , 
obeys the relation 

(5.42) ip(m) = G [aM [E)(m, a - l)^(o - 1) + Q [aJb] {E)(m, b + 1)^(6 +1), me [a, b] 

where G[ a ,b] (E) = (H[a,b\ ~ E) * is the Green function, Hum being the linear operator defined by l|5.41|l for 
n G [a, b] with zero boundary conditions. 

In view of Q5.42|l . one can prove exponential localization on [1,N] by showing that outside of some 
subinterval [no — L,n^ + L] (where logL <C logJV) all Green functions of a much smaller scale (say, scale 
(logN) c ) have exponential off-diagonal decay. Indeed, (|5.42|l would then imply that any eigenfunction 
is exponentially small outside of the window [no — L, no + L]. This strategy was introduced by Frohlich 
and Spencer in their fundamental work (see |FroSpel] , |FroSpe2] , |FroSpeWit| ) on Anderson's model. The 
following simple lemma shows that the question of exponential off-diagonal decay for the Green function is 
intimately related to the LDTs for the determinants. 

Lemma 5.9. Let w € T 0i0 . Suppose L(ui,Eq) = 7 > 0, 

(5.43) log\fi(z ,L0,E )\ >£L(u,E )-K/2 
for some z Q = e(x ), x G T, E G R, I > 1, K > (log£) A . Then 

\G llA (zo,u,E)(j,k)\ <expf-J(fc-i) + X 



2 

||^[i^](2o, u,E)\\ < exp(K) 
where Gn t e\ (-^o, oj, E ) = (Hun (z , oj) — Eq) 1 is the Green's function, 1 < j < k < I. 

Proof. By Cramer's rule applied to (Hun — E)^ 1 , the uniform upper bound of Lemma |5. 21 as well as the 
rate of convergence estimate \L — Li\ < l~ x from [Gol Schl , 

\G[i,e\(zo,u,E)(j,k)\ = \fj-i(z ,uj,E)\ ■ \fi- k (zoe(ku),u,E )\ ■ \f e (z ,uj, E )\ 1 

< |/,(z ,^,^o)r 1 exp(^(a>,So) - (k-j)L(w,E ) + (logif), 

and the lemma follows. □ 



(5.44) 



The idea behind implementing the aforementioned Frohlich-Spencer scheme is now as follows: First, if ip 
is an £ 2 -normalized eigenfunction of •ffri isn (^o, w ) with eigenvalue E, then (|5.43|) must fail with £ = (logiV) c 
for some zq — e(xo + k^uj). In other words, it must fail for some determinant f[k ,k +i]{ x o, w, E). Second, 
if it were to fail for another determinant f^^i+i] ( x o, w, E) where \ko — k\\ = t for a sufficiently large t, 
then this would lead to a contradiction of the separation of zeros property described above. Note that the 
latter dictates the size of t (here it turns out to be t = exp((loglog N) c ) ) and therefore also the size of the 
localization window. Moreover, note that we are forced to eliminate a set of energies and lo to achieve this 
separation of the zeros. Hence, we can only hope to obtain the localization property of the eigenfunctions 
if the energy falls outside a set of exceptional energies. We conclude that outside of some window all 
determinants of a smaller scale satisfy l|5.43f) . and thus the eigenfunction tp has to be exponentially small 
there (due to an application of the avalanche principle and H5.420 )• For further details we refer the reader 
to Section 10 of |GolSch2| . 
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5.5. Distances between eigenvalues on a finite interval. Dirichlet eigenvalues on a finite interval are 
simple. However, the eigenvalues can be as close as exp(— cN), where N is the size of the interval. It follows 
for instance from the analysis of in Sinai's work |Sinl] that this is the case if x belongs to some subset 
of T of measure exp(— *yN). In |GolSch2] the localized eigenfunctions were used to improve upon the e~ cN 
bound. Indeed, the "typical' distance between the eigenvalues turns out to be e~ N . 

It will be convenient for us to work with the operators H\_j^m(x,Lo) instead of -ff[i,./v] (x, u>) as we did in 

the previous section. We use the symbols Ej\ ipj N ^ to denote the eigenvalues and normalized eigenfunctions 
of iJ[_jv.AT](^! w )- We denote the sets of lo and energy, which we needed to remove in the previous section, 
by Sljv, £n,u- They are of measure < exp(— (logiV)^ 2 ) and complexity < exp((logiV) j41 ) where A2 ^> A\. 

Lemma 5.10. For any lu € T CiQ \ f2jv and all x one has for all j, k and any small 5 > 
(5.51) \EW(x,w)-El N) (x,w)\>e- ff ' 
provided Ej N \x,lu) ^ £n,u> and N > No(S). 

Proof. Fix x € T, Ej N \x,lu) ^ Sn,u>- Let Q X exp((loglog N) c ) . By the Anderson localization property 
(see the previous subsection as well as Section 9 in GolSch2j) there exists 

A Q := [vf\x,uj)-Q, ^ N) (x,lu) + q] n [~N,N] 

so that 



E 

ne[-N,N]\AQ 



I f[-N, n ] (e(aO, w ; E f ] (x, lu)) I 



N 
n=-N 



Here we used that with some fj, = const 

ipj N \x,u;n) = /j, ■ f [ _ N ^ 1] (e(x),Lu;E i j N \x,u)) 
for — N < n < N and the convention that 

J[-N,-N-l] = , f[-N,-N] = 1 • 

One can assume Vj^ {x, lu) > by symmetry. It follows from the avalanche principle (recall that all deter- 
minants of scale (\ogN) c which fall outside of the window of localization are non-resonant) and a simple 
stability bound in the energy that 

I f[-N,n] (e(x), to; E) - f[-N,n] (e(x) , w; #j (x, uj) \ 2 



-N 



<e-**\E-E$*{x, U )\ 2 eW V |/ hA , n] (e(*), W ;if > (*,«)) 



Let m — Vj (x, u>) — Q — 1. Furthermore, we bound the difference on the window of localization simply by 

/ /[-A',n+i](e(a;),w,£ , )\ / f[_ Ntn+ i](e(x),L0, Ef> (x,u)) 
K f { - NM (e(x),u,E) ) y f [ _ Ntn] {e{x),^,Ef l \x,Lj)) 

M (t\ ,(f[-N,n+iMx),u,E)\ (N) // [ _ JV , n+1] (e(x),w,£ i W ) 

M [ni+lin] (e(a;),a;,£;) - M [ni+1>n] (e(x), w, E) (s,w)J 

V /[-JV,n](e(a:),w,£?) / ^ f { _ NM (e{x),uj,E) N) ) 

1/2 

< e C(n-n l)e - 7 Q|^_ #)(, )W )| e (logN) c ' 



( E |/[-iV,„](e( : r),a;,sf ) (x,a;))| 2> ) 

VneAn / 



16 MICHAEL GOLDSTEIN AND WILHELM SCHLAG 

Now suppose there is E^ N \x,lo) with \E^ N ^ (e(x), ui) — E^ N \x,uj)\ < e~ N& for some small 5 > 0. Then, by 
the preceding, 

N 

E \f[-ff,n](e(x),w;Ef\x,u)) - f[_ Nin ](e(x),w",Ef'(x,w))\ 

n=~N 

<^ hNS E \f { -N M {e{x),u;Ef\x,u))\\ 

neA Q 

provided N s > exp((log log N) A ) . That contradicts the orthogonality of the eigenfunctions. □ 

5.6. Simplicity of the zeros of fiy(-,u>,E). An estimate for the minimal distance between the zeros of 
fiv(-,u},E) is crucial for the analysis of the IDS in |GolSch2] . In contrast with the eigenvalues of Hm(x,uj), 
x £ T, the real zeros of the discriminant /jv( - , w, E), E £l, can be degenerate. However, that happens only 
for special values of the spectral parameter E. This follows from the simplicity of the zeros of fw(x, u>, ■) by 
means of Sard-type arguments. To turn this statement into a quantitative estimate one has to make use of 
the estimate for the minimal distance between the Dirichlet eigenvalues from the previous subsection. The 
following general assertion, which is a combination of Sard's theorem and Cartan's estimate for analytic 
functions, allows one to do that. 

Lemma 5.11. Let f(z,w) be an analytic function defined in T>(0, 1) x T>(0, 1). Assume that one has the 
following representations: 

(i) f(z, w) = (w — bo(z))x{z, w), for any z £ 2?(0, ro), w £ T>(0, r±), where bo(z) is analytic in T>(0, ro), 
sup|6o(z)| < 1, x( z > w ) * s analytic and non-vanishing onD(0, ro) x T>(0, r{), where < ro,r% < ^ 

(ii) f(z, w) — P(z, w)9(z, w), for any z £ T>(0, ro), w £ £>(0, r%) where 

P(z, w) = z k + Ck-i(w)z k ~ 1 H hco(a)) , 

Cj(w) are analytic in 2?(0,ro), and 6(z,w) is analytic and non-vanishing on 2?(0,ro) x 2?(0, ri), and 
all the zeros of P{z,w) belong to 2?(0, 1/2). 
Then given H ^> k 2 log[(rori) _1 ] one can find a set Sh C T>(u>q, r%) with the property that 

mes (Sh) r l exp (—cH/k 2 log[(rori) -1 ]) , and compl(S'ff) < k 2 log[(rori) _1 ] 

such that for any w £ L)(0, ri/2) \ Sh and z £ T>(0, ro) for which w = bo(z) one has 

\b' (z)\ > e- kH 2- k ri . 

Moreover, for those w the distance between any two zeros of P(-,w) exceeds e~ H . 

Proof. Assume that k > 2 and set ip(w) — discP(-, w). If k = 1, then skip to (|5.66() . Then ^f(w) is analytic 
in 2?(0,ri). Assume that (^(if)! < r for some r > 0, w £ X?(0,ri). Recall that for any tu 

(5.61) ^(w) = H((i(w)-( j (w)) , 

where Q(w), i — 1,2,..., k are the zeros of P(-, w). Then \C,i(w) ~ Cj( w )\ < t 2 /*^ -1 ) for some i ^ j. Set 
Ci = Ci( u '): Cj = Cj( w )- Assume first Ci 7^ Cj- Then 

/(Ci,»») = o f(Q,w) = o, o < 10 - 01 < r 2 /^- 1 ) . 

Due to (i) one has w = bo(Ct) = &o(Cj)- Hence, 

(5-62) |^(C0| < ||C, - 01 max \%(z)\ < \Q - Q\r» 2 < r^ 2 r 2 ' k ^ . 

If = Cj tnen ^(Ci) w ) — 0, d z P(Q,w) = 0. Then f(d,w) = 0, d z f(Q,w) = due to the representation (ii). 
Then u> — bo(Ci) — 0; &'(Ci) — due to the representation (i). Thus i|5.62[l holds at any event. If ip(z) is 
analytic function in T>(0, r), then it follows from the general change of variables formula that 

mes {w : w = <p(z), z £ £>(0, r), |<^'(z)| < n} < irr 2 rj 2 

In view of the preceding one obtains 

(5.63) mes {w£V(0,n) : \ip(w)\ < r} < r„ 2 r 2/k( * -1) . 
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On the other hand, due to l|5.61[l one obtains 

sup{\i/j(w)\ : w G T>(0,n)} < 1. 
Take r <C (r ri) fe (' c ~ 1 )/ 2 . Then one obtains from l|5.63() that 

|-0(w)| > r 

for some \w\ < ^j. By Cartan's estimate there exists a set Th C V (0, ^) with 

mes T H <r\ exp (-cH/k 2 log[(r ri) -1 ]) 

and of complexity < k 2 log^rori)" 1 ] such that 

(5.64) log |^(w)| > -H 

for any w G V(0, ^)\ T H . 

In particular, (|5.64() implies that 



(5.65) \d(w) - Q(w)\ > e~ H 

for any w G T>(0, ^) \ 7#, i ^ j. Take arbitrary u>o such that dist(u>o, Th) > 2e~ H , wq — &o( z o) f° r some 
z G f(0,r o ). Then 

\P(z, w )\ > (2e H y k for all \z - z \ = e~ H /2 
by the separation of the zeros H5.65(l . By our assumption on the zeros of P(z, u>), 



sup sup \d w P{z, w)\ < r 1 . 

zGT>(0,r o ) 10625(0, ri) 



Thus, 



|P(z,w)l > V fe e- fcH if |z-z | -e- H /2, |w - w \ « 2" V' " / ,. 



Then due to the Weierstrass preparation theorem, 

(5.66) P(z,w) = (z-C(w))X(z,w) 

for any z G T)(zo,r' ), w G 2?(u;o,r5J, where Tq = e~ H /2, r[ <C e~ kH 2~ k ri, and C(^) is an analytic function 
in T>{wQ,r' 1 ) 1 \(z,w) is analytic and non-vanishing on T>(zo,r' ) x T>(wo,r' ). Comparing the representation 
(i) and Ij5.66|l one obtains 



(5.67) 



\w-b (z)=0 iff 
\z-((w) =0 

for any z G T>(zo,r' ), w G ^(wo, r 'i)- ^ follows from (|5.67l) that 

K(CM)| > ICHT 1 >ri > e -^2- fe n, 

as claimed. □ 

5.7. Harnack's inequality and Jensen's formula for the logarithm of the norms of monodromy 
matrices. The logarithm of the norm of an analytic matrix-function is a subharmonic function. Harnack's 
estimate in this context is not as sharp as for the logarithm of the modulus of an analytic function. The same 
comment applies to Jensen's averages. The latter here refers to the following: the Jensen formula states that 
for any function / analytic on a neighborhood of T)(zq, R), see |Levj . 



(5.71) 



[ log\f(z o + Re(9))\d0-log\f(z o )\ = J2 l °Z77^ 



0/(0=o 



R_ 

zol 



provided f(zo) ^ 0. We showed above how to combine this fact with the large deviation theorem and the 
uniform upper bounds to bound the number of zeros of /n which fall into small disks, in both the z and E 
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variables. In what follows, we will refine this approach further. For this purpose, it will be convenient to 
average over zq in (|5.71|) . Henceforth, we shall use the notation 

(5.72) Vf[zQ,r) = #{z e V(z ,r) : f(z) = 0} 

(5.73) J(u, z , r\, r 2 ) = j dxdy J d£drj[u(() - u{z)]. 

•D(z ,ri) T>(z,r 2 ) 

The following simple lemma is proved in GolSch2 . It is our main tool for counting zeros. 
Lemma 5.12. Let f(z) be analytic in T>(zo, Rq). Then for any < r 2 < r\ < Rq — r 2 

Vf(za,n -r 2 ) < J {\og\ f\,z ,r x ,r 2 ) < v f (z ,n + r 2 ) 

We now describe how the aforementioned technical issues were addressed in Section 12 of |GolSch2] for the 
transfer matrices of the Schrodinger co-cycles. More precisely, we state the two main results of that section. 
The reader should not be distracted by technicalities, but rather notice how the norms of the matrices mimic 
the behavior of the entries. For the latter the crucial piece of information is the number of zeros in various 
disks. In that respect, we emphasize the quadratic estimate in l|5.74|l . The linear estimate (i.e., the one 
where the scalar logarithm is not subtracted) would be too weak for the study of the IDS in IGolSch2'] . 

Proposition 5.13. (i) Suppose that one of the Dirichlet determinants 

f[l,N]{-,U,E), f \ 1>N _!](•, U,E), f [2iN ](-,U>,E), f [2 ,N-i](-,^,E) 

has no zeros in T>(zQ,r\), exp(— y/N) < r% < cxp(— (log iV) c ) . Then 

(5-74) lQ g \\^ N f Z ' U ' till ~ lQ S 1 1 + fl °( z ~ z °) I <l z -^o|V 2 - 2 

\\M n {z ,uj,E)\\ 

for any z S T>{zq,T2), r 2 = ri exp(— (log N) 2C ) , and with \cto\ < r^ 1 . 
(ii) Assume that the following conditions are valid 

(a) each of the determinants /r a jy_y(-, w, .E), a — 1,2; b = 0, 1 has at least one zero in P(£o,Po), 
where < p < exp(-(log 7V) Bo ) 

(b) no determinant f[ a ,N-b]('i w i E) has a zero in 2?((o,Pi) \ 'DiCo, Po) , Pi > e x p((l°g ^) Bl )po, 
B ^>B 1 +A. 

Let k = minZ(/ [a N _ b] (-,uj, E),( Q , p ). Then for any 

z,C e V(C ,p' 1 )\V(C ,p 2 ), p[ = exp(-(logiV) B2 )p 1 , p 2 = exp((logiV) B2 )p , B 1 » B 2 » 1 
one has 

Ml - , I 



\\M(z)\\ °\z-(o\ 



< exp(-(logiV) ( 



Proposition 5.14. (i) Assume that one of the Dirichlet determinants f\ a ^-u(-,u),E), a = 1,2, b = 
0,1 has no zeros in T>(zo,ri), exp(— V~N) < n < exp(— (log A^)*^ 1 ) . Then 

(5.75) ^J(\ag\\M N (;u:,E)\\,z Q , Pl ,p 2 ) < p^f 2 exp((logA^) s ) 

P2 

for any r\ exp(— ^/N) < Pi < r\ exp(— (log N) A ^j , p 2 = cp\ 
(ii) Assume that for some t$ the following conditions are valid 

(a) each of the determinants f[ a ^^{-,uj,E), a — 1,2; b = 0, 1 has at least one zero in 2?((o,po), 
exp(— </N) < po < exp(-(logA r ) Bo ). 

(b) no determinant /[ a .jv-b](', w , E) has a zero in T>(C,o,Pi) \ T> ((o,po), p\ > exp((log A r ) Sl )po, 
B a >B 1 . 
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Let k = min#Z(/[ a . A ,_ b] (-,w,£ , ),Co,Po)- Then for any 

a.b 

zi GP(Co,p'i)\^(Co,p 2 ), p\ = e* V (-{\ogN) B *)p u P2 xexp((logiV) B2 )po, 
B\ > B2, one has 

r 2 

(log\\M N (;co,E)\\,z 1 ,r 1 ,r2) ~ k < exp(-(logiV ) c ) 
where \zi — Co|(l + 2c) < r\ < p' x , r 2 = CT\, and < c <C 1 is some constant. 

6. The IDS: Lipschitz, Holder, and absolute continuity 

We now sketch the main steps that allow us in |ColSch2 | to pass from information about the zeros of 
/jv(-z,w, E) in z and E to information on the IDS. 

6.1. Concatenation terms and the number of eigenvalues falling into an interval. Consider the 
following expressions which we call concatenation terms in view of their role in the avalanche principle 
expansion: 

\\ M [i,k] E + if}) II ||-W[fe+i,iV] (e(x), uj,E + irj) \\ 



(6.11) W N<k (e(x),E + iri 

1 < k < N, where u> is fixed. 
Lemma 6.1. Let x e T, E e K, 77 > ; and let 
(6.12) 



\M[ ltN] (e(x),uj,E + in) 



|/[a,JV-6+i] (e(x), uj,E + ii])\= max |/[ a ',jv-6'+i] (e(x), uj,E + irj) 

Ka'b'<2 



for some 1 < a, b < 2. Then 

(6.13) #(sp H [aiN _ b+1] (e{x),uj)n(E-r i ,E + r 1 )) < ir, £ W N , k (e(x), E + irj) 



Kk<N 



Proof. By Cramer's rule 

(6.14) (H [a . N , ] {e{x),u)-E-n 1 y 1 {k,k) = 



f[ a ,k] (e(x), uj,E + irj) f[k+2,N'] (e{x), uj,E + irj) 
f[ a ,N>] (e(x),ui,E + irj) 



f[a,N'] (e(ar), u,E + irj) -f[ a +i,N'] ( e ( x )> u,E + irj) 
f[ a ,N'-i] {e(x),u, E + irj) -f[ a +i,N'-X\ (e(x), uj,E + irj) 



(6.15) M [a<N > ] (e(x),LJ,E + iri) = 
Due to (|6~l!f 

(6.16) \\M N (e(x),uj,E + irj)\\ < 2 /[ Q: Ar_ h+ i] (e(x), w, E + in) 
Combining l)6.14[l . 16.15|l . (|6.16(l one obtains 



tr 



K-ff[o,jv-6+i](a;,w) -E-i-nj J 



(6.17) 



a<k<N-b+l 



f[ a ,k] (e(x),w,E + irj) 


f\k+2,N-b+i] (e(x), u, E + irj) 




f[ a ,N-b+i] (e{x),w,E + irj) 





< 2W N ,kW x )> E + i v) 

a<k<N-b+l 



On the other hand, 



and we arc done. 



tr (H[ atN _ b+1 ](x,w) - E- irj) 1 

> (2tj)- 1 # (sp H [a , N _ b+1] (e(x), oj)n(E-ri,E + n)) 



□ 
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Recall that 

(6.18) (H(e(x),w) -E-ir/y 1 (k,k) < (H (e(x), u) - E - irj)^ 1 

Corollary 6.2. Using the notations of Lemma \6.1\ one has 

# (sp (H [1)N] (e(x),Lu)) n (E - T), E + rj)) 
<A V ^2 W N , k (e(x),E + ir,)+#(K) + 2 

Proof. Due to Weyl's Comparison Lemma, see |Bha| . 

# (sp (H [1>N] (e(x), u)) n (E - rj, E + n)) < 

# (sp (ff [o ,jv- 6 +i] (e(x), u))n(E-ri,E + Tj))+2 
Therefore, the assertion follows from the previous lemma. 

Lemma 6.3. Let A be an n x n Hermitian matrix. Let "I^ 2 ), . . . , <!<(") g C" be an orthonormal basis 
of eigenvectors of A and E^\ E^ 2 \ . . . , E^ be the corresponding eigenvalues. Then for any E + irj, E € M, 
rj > one has 

2 



□ 



E 10 



-E-irj) e k ,e k 



> E E 0*.* w ) 

l<fe<n l<j<n \l<fc<n 

where ei,e%, ■ ■ • ,e n is arbitrary orthonormal basis in C™. 
Proof. One has 



• Im 



— E — irj 



({A-E-irj)- 1 e k ,e k )= ]T |(e fc ,¥«>) 

l<j<n 



- £ - 



A-E-tr?) e fc ,e fc J > Im Hi — E — irj) e k ,e kj 

= E |( e fc»* W ) 2 Im(-B (i) -E-irj)' 1 . 
l<j<n 

Since Im (_E^' — E — irj) > 0, J = 1, 2, . . . ,n, the assertion follows (use . a 3 ) 2 > . a 2 if dj > 0). □ 

In the next corollary we show how to use effectively the localized eigenfunction to sharpen the estimate 
on the number of eigenvalues falling into an interval in an abstract setting. 

Corollary 6.4. Using the notations of the previous lemma assume that the following condition is valid for 
some E, rj: 

(L) for each eigenvector tyv) with \E^ — E\ < rj there exists a set S(j) C {1,2, . . . , n}, jf=S(j) < £ such 
that £ |(e fc ,*W)| 2 <l/2. 



Then 



E ij) - E 



< rj] < 8V \(( A ~ E 



Kfc<r 



Proof. Recall that for any positive oji, . . . , otf, with ^2 a j = 1 

i 

by Cauchy-Schwarz. Due to the assumptions of the corollary 

2 



1= = ^ |(e fe) *«)| < |( efe >^ 



Kk<n 



keS(j) 



1/2 
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for any — E\ < r\. Hence, for such we have 

(6.19) |( e fc>* 0) ) 4 > l/4€ 

kes(j) 

and the assertion follows from the previous lemma. 

Consider the following expressions which we also call concatenation terms: 

\\M 2m (z,uj,E)\\ 



□ 



W m (z) = - log W m {z) = log 



\M m (ze(mLo),uj : E)\\ \\M m (z,w,E)\ 



These terms are simple linear combinations of subharmonic functions. Hence, the Riesz representation 
theorem allows one to write them in the usual way, albeit with a signed measure rather than a positive 
one. Nevertheless, it turns out that this measure is an "almost" positive measure. This feature of the 
concatenation terms combined with the avalanche principle expansion allows one to establish a very sharp 
relation between these terms for different scales. Assume that the following condition holds: 

(I) no determinant f[ a ^ m -b][-e(nmu}),uj, E) , /[ a ,2m-(i]( , ! w ,-B), a = 1, 2; b = 0,1; n = 0,1, has a zero in 
some annulus 2?(Co,/?i) \ T>((o,po), where po x exp(— m s ), po < p\ < exp (— (logm)" 4 ), < 6 <C 1. 



Set 



n = 0,1, 



k n = min v 



nm v i \ 

a,b f[a, m -b] {■e(nmu),uj,E) 



(Co,Po) , 



k — min v 



fla,2m-b](;0J,E) 



(Co, A)) 



Lemma 6.5. Assume that pi > pQ° gm ^ . Then 



k + fci < k < k + ki + fa (A, V) 
provided Bq^>1. Here fci(A, V) is some integer constant. 

Proof. Recall that due to the large deviation theorem there exists z = e(x + iy) € 2?(£o,pi) \ £>(Co, p'i/2) 
such that 

| log ||-M[i !?Tl ] (ze(nm,Ld), ui, E^ || — mL(y, < to* (logTO.) 2 , n = 0, 1, 

\log\\M [h2m] (z,u,E)\\ -2mL(y,E)\ <m 5 (\og m y B2 
with 1 <C B2 < Bq. Combining these relations with Proposition 15 .131 one obtains 

\M lhm] (£e(mw), w, E) || || M m (C, u, E)\\ 



log- 



\M [1} 2m]((,U,E)\ 



(k + k x -k) log 



IC-Col 



I* -Col 



< Cm 5 (log m) 



Bi 



for any C, € P(Co,p'i) \ £>(Co,P2)- Since \z — Col X Pi, Pi X exp (— m <5 (logm) B ° — (logTO,) Bl ), one can pick 
C € £>(Co, Pi) \ X»(Co, P2) such that |C - Col = \z - ^ | ^ m >"° /2 . Then 

\M { i, m] (Ce(muj),uj,E)\\ \\M m (C,u,E)\\ 



(6.110) 

Recall that 
(6.111) 



log- 



\M 2m ((,u;,E)\ 



m s (logm) B «/ 2 (ko + h-k) 



< Cm s (\ogm) Bl . 



|M 2ro (C,w,£)|| < ||Af m (Ce(mw),w,^)|| \\M m (C,u,E)\ 
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Relations (|6.110|) . (|6.111|) imply k + k\ — k < 0. Removing the absolute values in (|6.110|) and (|6.111|) 

and taking Jensen's averages one obtains the following: 



44 J log 



\M 2m (;0J,E)\ 



\M[i tm ](-e(mw),u,E) \\ \\M m (-, u, E)\ 



, Co,n,r 2 ) - (k-ki- k 2 ) 



< 2 



where r\ = p'i/2, r 2 = cr\. Hence, k — k\ — k 2 < k\(\, V). 



□ 



J j,m 



The integers defined in the previous lemma almost perfectly substitute the measure representing the 
concatenation term. More precisely, the following assertion holds: 

Proposition 6.6. Given E 6 C and integer m 3> 1, there exists a cover of A po / 2 by disks T> {Cj,m, Pj,m)> 
Cj,m € A po / 2 , j = l,2,... , j rn such that the following conditions are valid: 

(1) exp(-m 5 ) < pj. m < exvi-m 5 / 2 ), j = 1, 2, . . . , j m , < S < 1, 

(2) dist(p(c, 1 , roi p. iro ),p(Ci 2 , ro) P j2!m )) > P jum +E h ,m whe ™P hm = P^r )B \ B, » 1, j = 
1,2,..., j m , provided ji ^ j 2 , 

(3) no determinant f[ a ,m-b] (-e(nmu;), u), E) or f[ a , 2m -b} ('i w ? E)> a = 1,2; b = 0,1; n = 0, has a zero in 

v (Ci,m , Pj,m) \ T> ( Ci,m , P ) , w/iere p = m ' Bl , p 3 , m = p\ 

(4) for each Cj.m there is an integer k(j,m), 

< k{j,m) < v hm{ ., u , E ) \Q,m,P j m ) 
for any z,(eD (Cj >m , 2p^ m ) \ V Kj,m, P jVm / 2 ) 

K-CrMnl 



J = l,2, 



1 Jmj 



suc/i that 



(w m (() - w m (z)) - k(j, m) log 



< IC - «r ' {Pj,m)~ 



\ z Cj,m\ 

Assume that the following condition is valid: 

(II. m) no determinant fi aim -b] (-e(nmw), uj, E) , /[ a ,2m-b]('i w i E) has more than one zero in any disk 
T>(zo,r m ), r m = exp (-(logrn) 4 ) , 20 S ^4. po / 2j n = 0, 1, a =1,2, 6 = 0,1 

Consider two concatenation terms w m (z) and w m (z) with m x exp (to* 51 ), < 5i -C 1. Let I? (Cj,m, Pj,m), 
j = 1,2, . . . ,j m and 2? (Cj,m, Pj.m) , j = 1,2, . . . , j™ be the disks defined in Proposition 16.61 for w m (z) and 
w m (,2), respectively. Note that due to the avalanche principle expansion we can conclude the following: 

Lemma 6.7. There exists !Fm,u,E C A Po with mes Fm.u.E < expf— m 1 / 2 ) smc/i that 



w m (z) — Wm {ze(m — m)uj) 



< exp 



(-in 1/2 ) 



for any z £ A Po / 2 \ J~m,u,E- 

Combining this assertion with the preceding one obtains the following Proposition (using the notations 
of Proposition ^. 6|) . 

Proposition 6.8. Assume that conditions fll.m), (II. m) are valid. Then, using the notations of the previous 
lemma one has 

(0) ifk(ji,m) = 0, then there exists C,j, m 6 T> m >Pj m ) w ^ ^{h m ) = such that 

\w m (z) — Wm {z\e({rn — m)uj)) | < exp ^— m 1 ^ 2 ^ 
for any zeV (c. jum , Ph,m) \ v \Cj,m,P jm ) and any 

,m; Pji ,m) \ »^"m, 

W) B, mes T m .^.E < exp 
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(1) if fc(?'i , m) — 1, then there exists Q,m G 2? (C^ TO ' An.™) k(j,m) — 1 smc/i t/iai 
w m (z) - w m (Ce((m - m)<j)) - log ( , |z ~ c ^ 1 — -) < exp ( -m 1/2 ) , 

\\Ce\(rn-m)u)-Cj, m \J \ ' 

for any zeD ^ _„,> Pn ,m) \ © (Cj.m , ^- m ) , C G 2? (c^ „ , ftx ,™) \ 4, U ,E ■ 

One can iterate the estimates of the previous proposition over a decreasing series of scales. In fact, this 
can be done all the way down to a scale of unit size. In this fashion one arrives at the following: 

Theorem 6.9. Let V(x) be real analytic. Assume L(u>o, E) > 70 > for some lo$ G T c a and all E £ (E' , E") 
and fix b > small. There exist N = Nq(X, V, 70, b, c, a) , t = t (A, V, 70, b, c, a) > so that: 

For any e > 0, there exists 0(e) C T, mes f2(e) < e such that for any u> G (u> — t , u> + t ) n (T c . a \ 0(e)), 
i/iere exists E^ie) C R, mes < e smc/i that for any N > Nq and any E £ (E',E") \ £ w (e) and any 

rj > l/N(logN) 1+b , one has 

(6.112) [ #(sp (H n (x,uj)) n (E- ?],E + rj)) dx < exp ((log e" 1 )^ rjN . 

In particular, the IDS satisfies 

Af(E + 77) - JV(E -r])< exp ((loge- 1 )^) 77 

for any E £ (E r , E") \ £ u (e), V > 0- 

The proof of Theorem 16.91 establishes the estimate (|6.112f) for any E £ M.\£ u (e), with very detailed 
description of £ u {e) as a union of intervals of different scales. This allows one to combine the Lipschitz 
estimate here with the Holder bound of Theorem 16 . 1 II below to prove the following 

Theorem 6.10. For almost all lu £ (luq — tq,ujo + to) the IDS Af(E) is absolutely continuous on (E',E"). 
In particular, if L(u>o, E) > 70 > for all E, thenAf(-) is absolutely continuous everywhere. 

Proof of Theorem WHA Let {E' n ,E'^) y 1 < n < n be disjoint intervals with e = J2(K ~ E 'n) < l - Set 

t = min (E" — E' n ). Let uj r = p r q~ x be a convergent of lu with q r > t~ 4 . Let m^ s \ s = 1, 2, . . . , t + 1 be 

n 

integers such that: (1) log(m( s+1 )) x (m (s) ) A , s = 1,2, ...,t, (2) £ > exp (-m' 1 ') > y^, = q r =: iV. 

Using the Lipschitz estimates of the previous theorem applied to each pair of consecutive scales , m( s+1 ) 
one can obtains 

1 r c 1 1 .. 

/2 



t+1 

r- TH> \ 

, Afl/2 ! JVl/2 

s=l 

the collection of such intervals. Then 



i / #(s P Hs{x, U ) n(j^ £ j^))dx< m VN-W 



for any interval LJ^ , C R \ |j whcrc ^ G Z, provided uj £ Y c . a \ [j fiW. Let {T f : t £ £} be 



1 y # sp H N {x,u) D f |J T< U< m« £ . 

Let £' = {£ G £ : n {K>K' : n = 1, 2, . . . ,n} ^ 0}. Then #£' < 2n. Since n < r" 1 one obtains: 



^y# ^spfr^wjn tM 



dx < m (1) N- 1/2 n < m (1) r . 



Finally, using the Holder bound of Theorem 16.111 as well as the measure and complexity bounds on the 
exceptional sets £^ yields 

i y #^s P H N (x,Lu)n ^(J^^rf a ;<cxp(-(logm( 1 )) A ) 

and we are done. □ 
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Note that the previous proof exploits detailed information on the size and complexity of those sets on 
which the IDS is not Lipschitz (the exceptional set). This is necessary, as can be seen from the example of 
a Cantor staircase function. Indeed, in that case there is a uniform Holder bound with an exponent that 
equals the Hausdorff dimension of the Cantor set. However, in our case the exceptional set has Hausdorff 
dimension zero, whereas the Holder exponent is fixed and positive. 

6.2. The Holder bound. The following result is proved in |GolSch2] . 

ko 

Theorem 6.11. Let Vo(e(x)) = v(k)e(kx) be a trigonometric polynomial, v(—k) = v(k), — ko < k < ko. 

— ko 

Let L(E,ojq) be the Lyapunov exponent for V = Vq and some u>q £ TT ca . Assume that it exceeds 70 for all 
Ee(E',E"). 

(1) Given po > there exists tq — tq(X, Vq, loq, 7o ; Po) with the following property: for any 1-periodic, 
analytic function V(e(x + iy)), —po < y < po assuming real values when y — and deviating from 
Vo(e(x)) by at most To, any ui 6 T c-a n (u>o — tq,uJq +to)> and any E £ (E',E"), with n = N~ 1+s , 
5 -C 1, N > 1, one has 

(6.21) / # (sp (H N (x, to)) n (E - r), E + 77)) dx < r)^~ e ■ N 

with some constant 1 < B and arbitrary e > 0. 

(2) The IDS M(-) satisfies, for any small e > 0, 

Af{E + -q) - Af(E - 77) < rj^~ £ , 
for all E S (E 1 , E") and all small r\ > 0. 

For the case of the almost Mathieu equation l|l.lfl (which corresponds to fco = 1) and large A, Bour- 
gain |Boul| had previously obtained a H61der-(i — e) result for the IDS, which is known to be optimal in 
those regimes, see |Sinlj . See also |Bou2| . 

The proof of Theorem l6.11l is similar to that of Theorem 16 . 91 above . Recall that the latter result exploited 
the fact that as long as we remove all energies E belonging to some bad set the zeros of /jv( z , E) in z 
do not cluster. In fact, any small disk (say of size e~ N ) does not contain more than one zero. 

The logic is that here we can no longer guarantee this separation property of the zeros since we are not 
allowed to remove energies. Nevertheless, we will be able to show that zeros cannot cluster too much in any 
small disk. The argument proceeds by contradiction: If there were too many (in fact, 2deg(V^) + 1 many) 
zeros in a small disk, then we can show that this would have to be the case in a large number of disks of the 
same size (by exploiting the dynamics). Ultimately, this leads to a contradiction due to the fact that the 
determinant /jv cannot have more than 2N deg(V) zeros in total. This fact appears to be of independent 
interest, and is formulated as a theorem in |GolSch2j : 

Theorem 6.12. Using the notations of Theorem \b.ll\ there exists ko(X,V) < 2degVo with the following 
property: for all E £ R, s G Z and oj £ T CM and any Xq £ T there exists s - , s + with \s — s ± \ < exp((log s)* 5 ) 
such that the Dirichlet determinant f[- s - tS +](-,Lu,E) has no more that ko(X,V) zeros in T>(e{xo), ro) , ro x 
exp (—(logs)" 4 ) . 

While wc need to refer the reader to GolSch2 for more details, we do present some basic statements here, 
which elucidate the role of Jensen averages and the avalanche principle in this context. 

Definition 6.13. Let I 3> 1 be some integer, and s £ Z. We say that s is adjusted to a disk T>(zo,ro) at 
scale £ if for all k x I 

Z(f k (-e((s + m)u),uj,E),zo,ro) = V \m\ < CI. 
Consider the avalanche principle expansion of log | /n(z, uj, E) | : 

71—1 71—1 

(6.22) ]og\Mz,u,E + iri)\ = ^ \og\\A m+1 (z)A m (z)\\ ~ £ log || A m (z)\\ + O (exp^ 1 / 2 )) , 

771—1 m — 2 
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for any z S A po /2 \ B E , v ,u, mes B E , v ,u < exp(-£ 1/2 ), where A m (z) = M^{ze[s m uj),uj,E + irj), m 



,.,n-l, A x (z) = M tl {z,u,E) 



1 


A 



A n (z) 



Mi n (ze(s n u>),u!,E), t m = £,m=l,2,. 



,n — 1, 



1 


£ n =£, (n-l)£ + £ = N,£,£~(logN) A , s m = £ £ : 

This expansion allows us to control the number of zeros of the large scale object (in this case /at) by 
means of the number of zeros (or rather, the Jensen averages) of the small-scale objects (here Wj) and vice 
versa. Surprisingly, it turns out that the most effective way to implement this idea is to obtain an estimate 
for the local number of zeros at a smaller scale in terms of the total number of zeros at a larger scale. The 
all-important quadratic (more precisely, super-linear) error estimate here is due to l|5.74|) above. 

Lemma 6.14. Assume that {s mj } 3 jLi is adjusted to 2?(zo,ro) at scale £. Set tuq = 0, rrij Q+ i — n, and 



mj+l 



Wj(z) = log 



n a ^ z 



for any 0<j< jo 



Then 
(6.23) 



J (log\fN(-,u,E)\,z ,ri,r 2 ) - ^ J(wj(-),zo,ri,r 2 ) < N exp((log£) c ) rjr Q 2 



3=0 



for any e ^ < r\ < exp(~(\og£) A )ro, and r 2 = cr\. In particular, 



(6.24) 4^ j(log|/ w (.,w,E)|,zb,ri,ra) > £ j( Wj ■(•), zq, n, r 2 ) - N exp({\og£) c ) r\r^ 2 
for any J C [0, j ] . 

For the remaining details (in particular, the crucial notion of "contributing" terms) we refer the reader 
to |GolSch2| . 

7. Generic C 3 potentials 



In this section we review some recent work of Jackson Chan, see [Cha . Given any function V : T — » 1 
we have a family of quasi-periodic discrete Schrodinger equations 

(7.1) -y{n+ 1) - <p(n- 1) + XV(x + nu)(p(n) = E<p(n), neZ 

where (x, uj) G T x T are parameters. Equation (|7.1fl can be rewritten as a first order difference equation: 



<p(n+ 1) 
(p(n) 



XV(x + nuj) - E -1 
1 



<p(n) 
ip(n - 1) 



Given a C 3 potential V, any C 3 function V satisfying the conditions 

max\V(x)-V(x)\<6 

m&x\V'(x) ~ V'(x)\ < S 

max\V"(x)-V"(x)\<6 



can be written, near x = 0, in the form 



1 



V(x) = V(x) +r) + £x + ^9x 2 + x 3 R(x) 

where \r]\, |£|, \9\ < 6, R £ C 3 , \d a R\ < 1 for any index \a\ < 2. More generally, since T is compact, we can 
find some large integer T so that 

T 



(7.2) V(x) = V(x) + £ [Vm + U(x + \e m (x - + (x - ^fR m (x 

m— 1 



T' 



T ■ 



2G 
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T 

for all x £ T, where rj = (771, . . . , ryr), £ = (6, ■■■,&), = (6>i, • ■ • , <?t) £ EihMi and R ™ e C 3 ' 

1 

\d a R m \ < 1 for any index \a\ < 2. This motivates the following definition. 

Definition 7.1. Let T be a large integer, < 6 <C Suppose R m {rj m ,£ m ,9 m ]x) are C 3 functions, 

3T 

m = 1, 2, . . . , T, (r/, £, 0) e n[~^' ^L- x G satisfying the following conditions: 

l 

|5 a -Rm(r/m,Cm,0m;a;)| ^ y for any index |a| < 3 
i? m (0,0,0;a;) = 

R m (Vm,£ m ,0 m ;x) = -x~ 3 (r) m + £ m x + ^9 m x 2 ) for |x| > — 

Define a (T, o")-variation of the potential by 

T 

w{n,£,6,{R m y,x) = J2 v m( 



m 

mi* 



T 

m— 1 

where 

(^?m ) £m > @m : ^/m "I - £,m x "t - ~^@m x ~1~ -^m (^?m ) £m ) ? 

By the preceding, 

u m (0,0,0;a:) = 

and 

v m {Vm,£m,0 m ;x) = for > — . 

Denote the collection of (T, o~)-variations of the potential by S(T,S). The set of parameters (j],£,9) has 
measure (2<5) 3T . We want to define a notion of "typical" potential by using the normalized measure on this 
set of parameters. Hence, a set S C S(T,S) is called (1 — e)-typical if 

151 ;= {T 1 } (2lW mes { (?7 ' ^ e) G [_<5 ' 5]3T : wiv ' ^ e ' {Rm}; ■) G 5 ) ^ 1 - e 

Theorem 7.2. Given any V £ C 3 (T), t/iere is Ao = Xo{V) such that for |A| > Ao, one has a collection of 
perturbed potentials {S e = St(V, A)}£L 1; S e C S(T^,6 e ), logT^ +1 ) x (T^) a , 

OO 

< a « 1 ; (l — < A _/3 ; so that for any potential 

OO 

V(x) = V(x)+Y,W^(^\^\9^\{R^};x) 

e=i 

where W^' £ Si, there exists = V), mes Q < A - * 3 , so that the Lyapunov exponent L(ui, E) > j log A 
for any u> £ T \ fi, E £ M.. Furthermore, the corresponding eigenf unctions are exponentially localized. 

There are two central technical problems which one has to deal with in order to establish this theorem. 
The first one consists of the splitting of eigenvalues of the problem 17.1fl on a finite interval [— JV, N\. The 
technology for this splitting developed in |GolSch2] for the case of an analytic potential can be modified for 
a "generic" smooth potential. 

Proposition 7.3. Using the notation of Theorem \ 7.2\ there exist integers T' s , logT s ' X logT^ s ^, such that 



for any nested sequence of intervals J~ s ,k s — [^S )> an d x £ T, lo £ T \ fl, there is a sequence of integers 
{N s = N s (x,uj)}, with \ogN s X logT s ', so that 

(7.3) \Ei - E 2 \ > exp(-iVJ) 

for distinct eigenvalues E\, E 2 £ (sp H\_n b ,n, 
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The second problem is as follows. The eigenvalues of the problem (1.1) on a finite interval [1,N] have a 
parametrization Ei(x) < E2(x) < . . . < Ejy(x), x £ T which are as smooth as the potential V(x). This 
general result is due to the self adjointness of the the problem (|7.1|) and nondegeneracy of the the eigenvalues 
of l|7.1|l restricted on a finite interval. The problem is how to evaluate the quantity 

(7.4) la^i + ia^i 

from below. 

This problem was also studied in [GS2]; for analytic potentials, the problem was solved using discriminants 
of polynomials and Sard-type arguments. This method has no modification for smooth potentials. This is 
the very problem for which the variations of the potential were introduced. The most basic idea of the 
method of |Cha| is as follows. 

"Typical" C 3 functions F(x) are Morse functions, i.e., the quantity 

(7.5) \d x F\ + \d xx F\ 

has a good lower bound, gauged according to the size of F. On the other hand, there is a basic relation 
between d x Ej and the potential V(x): 

N 

(7.6) d x Ej = £ V'{x + ku)\<pj(x)(k)\ 2 

where ipj(x)(.) is a normalized eigenfunction of (|7.1(l on the interval [1,N] corresponding to Ej(x). The 
relation 17.6(1 enables one to express the "genericity" of the potential V in terms of a lower bound, provided 
tpj(x)(.) is exponentially localized. Ultimately, the bad cases can be eliminated by varying the frequencies u>. 
The Sard-type arguments allow one to show that the total measure of those u> for which there is no response 
in H7.6fl under the variations of V is extremely small. 
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